Picture for Guan Huang

Guan Huang

UniDriveDreamer: A Single-Stage Multimodal World Model for Autonomous Driving

Add code
Feb 02, 2026
Viaarxiv icon

EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation

Add code
Sep 26, 2025
Figure 1 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Figure 2 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Figure 3 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Figure 4 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Viaarxiv icon

EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer

Add code
Sep 26, 2025
Figure 1 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Figure 2 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Figure 3 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Figure 4 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Viaarxiv icon

MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training

Add code
Sep 26, 2025
Figure 1 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Figure 2 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Figure 3 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Figure 4 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Viaarxiv icon

ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction

Add code
Aug 11, 2025
Figure 1 for ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction
Figure 2 for ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction
Figure 3 for ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction
Figure 4 for ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction
Viaarxiv icon

Humanoid Occupancy: Enabling A Generalized Multimodal Occupancy Perception System on Humanoid Robots

Add code
Jul 27, 2025
Viaarxiv icon

WonderFree: Enhancing Novel View Quality and Cross-View Consistency for 3D Scene Exploration

Add code
Jun 25, 2025
Viaarxiv icon

Motion-R1: Chain-of-Thought Reasoning and Reinforcement Learning for Human Motion Generation

Add code
Jun 12, 2025
Viaarxiv icon

GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning

Add code
Jun 12, 2025
Figure 1 for GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning
Figure 2 for GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning
Figure 3 for GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning
Figure 4 for GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning
Viaarxiv icon

RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer

Add code
May 29, 2025
Viaarxiv icon